STream3R is a scalable sequential 3D reconstruction model based on causal Transformer, which redefines point cloud map prediction as a decoder-only Transformer problem. It introduces a streaming processing framework, efficiently processes image sequences using causal attention, and can generalize well to various challenging scenarios, including dynamic scenes where traditional methods often fail.
Computer Vision
Safetensors